Exciton-plasmon states in nanoscale materials: breakdown of the Tamm— Dancoff 

approximation 
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Within the Tamm-Dancoff approximation ab initio approaches describe excitons as packets of 
electron-hole pairs propagating only forward in time. However, we show that in nanoscale materials 
excitons and plasmons hybridize, creating exciton-plasmon states where the electron-hole pairs 
oscillate back and forth in time. Then, as exemplified by the frans-azobenzene molecule and carbon 
nanotubes, the Tamm-Dancoff approximation yields errors as large as the accuracy claimed in ab 
initio calculations. Instead, we propose a general and efficient approach that avoids the Tamm- 
Dancoff approximation, and correctly describes excitons, plasmons and exciton-plasmon states. 

PACS numbers: 73.20.Mf,78.67.-n,71.15.Qo,31.15.-p 



The Bethe-Salpeter (BS) [1] and the Time-Dependent 
Density Functional Theory (TDDFT) @] equations al- 
low the accurate calculation of the polarization func- 
tion of many physical systems without relying on ex- 
ternal parameters. Within these frameworks neutral 
excitations are described as combination of electron- 
hole (e-h) pairs of a noninteracting system. However for 
nanoscale materials, the huge number of e-h pairs in- 
volved makes the solution of the BS/TDDFT equation 
extremely cumbersome. Consequently, the increasing in- 
terest in the excitation properties of such materials has 
justified the use of ad-hoc approximations. The most im- 
portant and widely-used is the Tamm-Dancoff approx- 
imation (TDA) 3] where only positive energy e-h pairs 
are considered. Within the TDA the interaction between 
e-h pairs at positive and negative (antipairs) energies is 
neglected, and only one e-h pair is assumed to propagate 
in any time interval. The main advantage of the TDA is 
that the non-Hermitian BS/TDDFT problem reduces to 
a Hermitian problem, that can be solved with efficient 
and stable iterative methods 

In Solid State Physics — a major field of application of 
the BS/TDDFT equation— the success of TDA is based 
on the sharp distinction between excitonic and plasmonic 
excitations. Excitons are localized packets of e-h pairs 
bound together by the Coulomb attraction and are ob- 
served in optical absorption experiments. Plasmons are, 
instead, delocalized collective oscillations of the electronic 
density that induce a macroscopic polarization effect and 
are observed in electron energy loss (EEL) experiments. 
In contrast to the case of excitons, the TDA is known 
to misdescribe plasmons in solids [5| as the density os- 
cillations involve the excitation of e-h antipairs. Nev- 
ertheless, the success in describing optical absorption of 
solids and the remarkable numerical advantages have mo- 
tivated the application of the TDA to very different sys- 



tems. Nowadays the BS/TDDFT equation within the 
TDA is becoming a standard tool to study excitations in 
nanostructures [a, ll| , and in molecular systems [8( . 

In this Letter we argue that for confined systems — 
such as nanostructures or 7r-conjugated molecules — the 
excitations appearing in the response function show a 
mixed excitonic-plasmonic behavior. As a consequence 
the e-h pair-antipair interaction becomes crucial and the 
TDA does not hold anymore. A paradigmatic example 
is the trans-azobenzene molecule, where the TDA overes- 
timates the static polarizability by ~40%. This error is 
larger than the claimed accuracy in ab initio calculations. 
Even more intriguing is the case of carbon nanotubes 
that, because of the quasi-one-dimensional (ID) struc- 
ture, behave either as extended or isolated system de- 
pending on the direction of the perturbing field. Thus, 
for transverse perturbations the excitons acquire a plas- 
monic nature and the TDA overestimates the position of 
the 7r plasmon peak appearing in both absorption and 
EEL spectra by almost 1 eV. By exploiting the symme- 
try properties of the BS and TDDFT kernels we devise a 
robust and efficient iterative approach to calculate the 
frequency-dependent response, beyond the TDA. This 
approach benefits from the same numerical advantages 
of Hermitian techniques, and correctly describes excitons, 
plasmons and exciton-plasmon states. 

To introduce and understand the reasons beyond the 
breakdown of the TDA we need to study in detail the 
structure of the TDDFT and BS equations. These are 
commonly rewritten as a Hamiltonian problem 0], by 
expanding the single particle states in the Kohn-Sham 
basis. Then, the BS/TDDFT Hamiltonian H is a matrix 
in the Fock space of the e-h pairs \eh) and antipairs \he), 
and it has the block-form 
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The resonant block R is Hermitian and the coupling 
block C is symmetric (see Appendix B of Ref. [§]). 
The dielectric function e (uj) is written in terms of 
the resolvent of H, (u — H) , as e (ut) = 1 — 
(8tt) /n(P\ (uj- H + iO+y 1 \P), where Q is the simula- 
tion volume. In the limit of large the polarizability 
is given by a (u>) cx s(oj). \P) is a ket whose compo- 
nents along the \eh) space are the optical oscillators: 
(P\eh) ~ (e\d ■ £\h), with d the electronic dipole, and 
£ the light polarization factor. 
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FIG. 1: Dynamical polarizability of the irans-azobenzene 
molecule calculated within the TDLDA either by using the full 
Hamiltonian (solid line) or the TDA (dashed line). For comparison 
the results obtained by diagonalization (circles) are also reported. 
The TDA largely underestimates the static polarizability, by al- 
most 40 %. More importantly the amplitude function (inset) of the 
most intense peak of the polarizability (see text) differs dramati- 
cally in the TDA and in the full Hamiltonian. The TDA misses of 
an important contribution from the spatially extended e-h antipair 
with energy ~ — 2.7eV. 

The matrix elements of R and C have different def- 
initions in the BS and TDDFT Hamiltonians. Within 
the BS equation R ee , = E eh S ee ,S hh , + {eh\W - 2V\e'h'), 

hh' 

and C ee > = (eh\W - 2V\h'e'). E eh is the energy of the 

hh' 

independent e-h pair, W is the statically screened e-h at- 
traction and V is the bare Coulomb interaction without 
the long-range tail [§]. Within TDDFT W is replaced by 
— 2/ xc , with / xc the exchange-correlation kernel. 

The TDA assumes that the effect induced by the e-h 
antipairs is negligible. Consequently, the C block, that 
couples pairs and antipairs, is neglected, and the Hamil- 
tonian H is approximated by R. The coupling described 
by C is dominated by the contribution arising from the 
bare Coulomb interaction V. This term, called Hartree 
contribution, measures the degree of inhomogeneity of 
the electronic density. The larger this inhomogeneity, the 
stronger the corresponding polarization and the induced 
local fields that counteract the external perturbing field. 
Therefore, the inhomogeneity of the electronic density 
and the strength of local fields discriminate whether an 
excitation is well-described by the TDA. Furthermore the 



TDA is known to fail for plasmonic excitations [5] that, 
causing an oscillation of the density, involve the creation 
of e-h pairs and antipairs. 

The electronic density of confined systems, like 
molecules and nanostructures, is typically strongly in- 
homogeneous. Moreover the excitons can be spread all 
over the molecule, involving the excitations of most of the 
electrons. Thus, in contrast to solids, it is not possible to 
distinguish between excitonic and plasmonic excitations, 
and the arguments commonly used to sustain the TDA 
fail. 

Indeed, the striking failure of the TDA is clearly 
demonstrated by the dynamical polarizability 9 [a (w)] of 
trcms-azobenzene, calculated within the time-dependent 
local density approximation (TDLDA) [n], [n|. In 
Fig. [T] we compare 3 [a (u>)] calculated either using the 
TDA, or by solving the full H eigenproblem. TDA 
yields a blueshift of 0.2 eV and a large overestimation 
of the intensity of the main peak. More importantly 
the TDA causes a 40% underestimation of the static 
polarizability, K [a (u) = 0)] . The reason for this fail- 
ure can be understood looking at the amplitude function 
A x (bj) of the eigenstates |A) of H , defined as A x (oj) — 
E„ ={eft}>{ ^ } \{VW?5 (w - E n ). The A*™- function for 
the state X m ax corresponding to the most intense peak 
of the a spectrum is shown in the inset of Fig. [TJ In the 
TDA, the X max state is decomposed only in positive e-h 
pairs. However, the solution the full Hamiltonian reveals 
an important contribution from the e-h antipair with en- 
ergy ~ — 2.7eV. This term, neglected in the TDA, causes 
a dramatic overall redistribution of weights, increasing 
the contribution of e-h pairs at high energies. Thus, the 
contribution of the e-h antipair — corresponding to a spa- 
tially extended tt* — > ir transition — changes the character 
of the excitation. Whereas in the TDA the most intense 
excitation in the spectrum can be identified with a sin- 
gle e-h pair, in the full H solution it acquires a more 
collective, plasmon-like character. 

The example of the irens-azobenzene makes clear that 
a proper description of the electron-electron correlations 
in confined systems requires the solution of the full 
BS/TDDFT Hamiltonians, beyond the TDA. However, 
for larger nanostructures with many degrees of freedom, 
the size of the H matrix can be as large as 10 6 x 10 6 
and consequently the problem is impossible to treat if 
not using iterative methods. The TDA reduces H to a 
Hermitian Hamiltonian, and makes possible to use the 
efficient and stable Hermitian iterative approaches 0]. 
Therefore we need an iterative approach for calculating 
the resolvent of the full non-Hermitian Hamiltonian H as 
efficient and stable as for the Hermitian case. 

In what follows, we show that it is indeed possible to 
design such an iterative approach by observing that H 
belongs to a class of non-Hermitian Hamiltonians with a 
real spectrum — that is real eigenvalues. As established 
by Mostafazadeh [lj|, the reality of the spectrum is re- 
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lated to the existence of a positive-definite inner prod- 
uct with respect to which the Hamiltonian is Hermitian. 
We show that for the BS (TDDFT) Hamiltonian— and in 
general for all the Hamiltonians of this form — this inner 
product does exist (thus the spectrum is real) and, more 
importantly, it is explicitly known. The knowledge of this 
product allows one to conveniently transform the itera- 
tive approach designed for the Hermitian TDA Hamilto- 
nian to treat the full non-Hermitian Hamiltonian. 

Following Zimmermann [l3| we can write H as the 
product of two noncommuting Hermitian matrices, 



H = FH 



1 0\(R C 
-lilC* R* 



(2) 



One can check that HH = H'H. This key property 
of H is called -ff-pseudo-Hermicity 14]. Through the 
H operator, we can define a positive-definite .H-inner 
product [l5| (-\-)h '■= ('\H\-), and a corresponding H- 
expectation value (-\0\-)jj := (-\HO\-). With respect to 
this ii-expectation value, H is Hermitian as can be ver- 
ified by using the -ff-pseudo-Hcrmicity, 



h- 



(v\H\v') s = (v'\H^H\v)* = (v'\HH\v)* =: (v'\H\v) 

(3) 

It follows that the H-expectation value of the resolvent 
of H, (u> — H)~ , is Hermitian as well. Then, to eval- 
uate (P\(oo — H)~ 1 \P), and thus e(oj), we rewrite it in 
terms of Hermitian i7-expectation values by using the 
completeness relationship / = J2k \lk)(lk\H 

(P\(u H)- l \P) = J2(PWk){q k \(u H)- l \P) R , (4) 



where {1%}} is a complete basis, orthonormal with re- 
spect to the ff-inner product. The (qk\(^ — H)~ 1 \P)h 
are conveniently calculated within the standard Lanczos- 
Haydock (LH) iterative method [3] — the same used in 
the TDA Hermitian case fl7| — provided that the H-inner 
product replaces the standard one. The LH method recur- 
sively builds the {|<?fc}} basis in which H is represented by 
a one-dimensional semi-infinite chain of sites with only 
nearest- neighbors interactions. For such a system the 
evaluation of the matrix elements of (uj — H)^ 1 reduces 
to the calculation of a continued fraction. 

This approach allows us to treat systems as large as 
commonly done using the TDA, but using the full Hamil- 
tonian. Computationally, it requires a single matrix- 
vector multiplication at each iteration — as in the Hermi- 
tian case [3. Since (P\(u - H)- 1 \P) in Eq. (j4|) is con- 
verged after a number of iterations much smaller than 
the dimension of H, this method is far more efficient 
than performing the diagonalization of the Hamiltonian. 
Specifically, for the irens-azobenzene, the number of op- 
erations performed in the diagonalization is about two 
orders of magnitude larger than in our approach [l9| . 
while the results are indistinguishable (Fig. [1} . 
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FIG. 2: Polarized (angle <fr with respect to the CNT axis) ab- 
sorption (left stack) and ^-dependent EEL (right stack) spectra of 
zig-zag (8,0) CNT calculated within the BSE either by using the full 
(solid line) or the TDA (dashed line) Hamiltonian. The dotted line 
indicates the position of the quasiparticle band gap . The gray 
area shows the experimental energy blucshift (0.5 eV) between the 
parallel and transverse plasmon mode [22| , correctly reproduced by 
full Hamiltonian calculations. Instead, the TDA predicts the trans- 
verse plasmon mode to red-shift, in striking disagreement with the 
experiment. 

As an application we consider the frequency-dependent 
response of carbon nanotubes (CNTs) within the BS 
equation. Recently, experimental studies have shown 
CNTs to be characterized by strongly anisotropic elec- 
tronic and optical properties 2l|,|22|. Moreover, the mea- 
sured optical and EEL spectra have revealed that excita- 
tions appearing at energies higher than 4 eV have a collec- 
tive character 21 , 22| • Using the present method we show 
that the external field polarization and the excitonic- 
plasmonic character of the optical excitations of CNTs 
are intimately related. 

So far ah initio studies have been limited to the ex- 
citonic effects appearing below 4eV and only for longi- 
tudinal polarized light [7|. In fact, calculations of the 
optical response to transverse perturbing fields are ex- 
tremely challenging, as the plasmonic character of the 
excitation requires the inclusion of a huge number of e-h 
pairs. Using the method proposed in this work we were 
able to include up to ~ 1.4 x 10 5 e-h pairs in the solution 
of the BS equation, without using the TDA. 

Figure [2] shows the absorption (left stack) and q — » 
EEL (ri ght stack) spectrum of CNTs calculated within 
the BS [id lipj l equation as a function of the angle <f> 
that the perturbing field forms with the tube axis. We 
compare the full solution of the BS equation (full line) 
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with the result of the TDA (dashed line). 

For a parallel perturbing field (tp — 0) the CNT be- 
haves as an extended solid. As expected, in this case the 
TDA describes well the excitonic peaks in the absorption 
spectrum [7|. On the contrary the TDA completely fails 
in reproducing the longitudinal plasmonic peak, overesti- 
mating its frequency by 1.5 eV. By increasing the <f> angle 
the perturbing field acquires a perpendicular component, 
and a peak at ~6eV appears in the e (w) function. The 
very same peak occurs in the EEL spectrum, confirm- 
ing that it corresponds to a mixed excitonic-plasmonic 
excitation. This mixed behavior is misdescribed by the 
TDA both in the absorption and in the EEL spectra. 
The TDA performs even worse for a transverse perturb- 
ing field (4> — 90°), where the excitations are confined 
within the tube radius. The e (u>) and e" 1 (u>) functions 
become very similar — that is, the system behaves as an 
isolated molecule. Thus, like in the case of the trans- 
azobenzene, the contribution from e-h antipairs cannot 
be neglected, and the TDA yields an error on the posi- 
tion of the main peak in the absorption/EEL spectra of 
-0.6 eV. 

Our results for the CNT show that, in contrast to 
the common belief, the excitations appearing in 9(e) are 
not all purely excitonic. When the excitation is forced 
to be spatially confined, it acquires a mixed excitonic- 
plasmonic character, and the TDA breaks down. The 
TDA performs well only for purely excitonic states that 
are dominant only when the perturbing field is polarized 
along the tube axis. However, in practice it is not possi- 
ble to measure selectively only longitudinal polarization 
as CNTs are generally randomly oriented in a sample. 
Even in the case of vertical aligned CNTs the tubes are 
found to form angles of —25° [22|. 

Finally, an important confirmation of the accuracy of 
the present approach is given by the position of the plas- 
mon peaks in the EEL spectra. Using the full solution 
of the BS equation, the plasmon energy is found to blue- 
shift of -0.5 eV by moving from cj> = 0° to <j) = 90°. This 
is in striking agreement with the 0.5 eV value measured 
in momentum dependent EEL experiments [22j ]. On the 
contrary, the TDA causes a energy redshift of the plas- 
mon, in disagreement with the experimental results. 

In summary, we have shown that the TDA breaks down 
in nanoscale systems where dimensionality effects con- 
fine the optical excitation, inducing a mixed excitonic- 
plasmonic behavior. We propose a novel approach to 
solve the BS/TDDFT equations beyond the TDA, keep- 
ing the numerical advantages of a Hermitian formulation. 
This approach successfully explains the experimental fea- 
tures in the optical and EEL spectra of CNTs, and opens 
the way to a truly ab initio approach to linear response 
properties of nanoscale materials. 
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